Offline Stack Overflow through stackdump (2017-03-14)
- Type:
- Other > Other
- Files:
- 1
- Size:
- 39.46 GB
- Tag(s):
- Stack Overflow stackoverflow stackdump Stack Exchange reference mirror
- Uploaded:
- Mar 19, 2017
- By:
- Pipinpad
(Make sure to click my username to check that this is the latest release) What is this? ============= An offline mirror of Stack Overflow and most of the Stack Exchange sister sites, with questions and answers posted up to 2017-03-14. The search engine is not perfect, but it can be invaluable when you're without internet. A few StackExchange sites such as math.stackexchange.com and some language-related sites are not included in this package, to save on space. You can add them yourself if you need them. See these for how: Stackdump viewer: http://bitbucket.org/samuel.lai/stackdump Stackexchange data: http://archive.org/details/stackexchange Why? ==== You could recreate this package by downloading the stackdump viewer and importing the stackexchange data yourself, but it would take a day or more of work, lots of RAM (> 12GB) and lots of disk space. I made this for myself, and thought I'd save you that effort. Is this legal? ============== Yes. The data is released by Stack Exchange Inc. under cc-by-sa 3.0 which allows to "remix, transform, and build upon the material". The viewer program is "stackdump" by Samuel Lai, and is released under the MIT License (see link above). How do I use this? What do I need? ================================== You need Python (2.5 or later, but not 3.0), Java (6 or later) and 56 GB of disk space. The package is a working Mercurial repository (which also provides a way to check that the code hasn't been altered). You can update the stackdump viewer with a "hg pull" when new versions appear, as well as add and remove sites with manage.sh if you want more, or to free up space. See the stackdump site for instructions how. Linux: ~~~~~~ Run this in the stackdump directory: ./start_solr.sh & ./start_web.sh to start up a webserver on localhost:8080. To shut it down, Ctrl+C the two scripts. Windows: ~~~~~~~~ Launch these two scripts from a shell: Start-Solr.ps1 Start-StackdumpWeb.ps1 to start up a webserver on localhost:8080. To shut them down, Ctrl+C them. You might need to first turn on script-running for PowerShell (run Set-ExecutionPolicy RemoteSigned in C:\Windows\SysWOW64\WindowsPowerShell\v1.0\powershell.exe started as Administrator). You might then need to edit Start-Solr.ps1 and reduce -Xmx2048M to -Xmx1048M (maybe only for 32-bit version of PowerShell? Not sure). After that, starting Start-Solr.ps1 and Start-StackdumpWeb.ps1 in two different C:\Windows\SysWOW64\WindowsPowerShell\v1.0\powershell.exe should put up the webserver. Other OSes: ~~~~~~~~~~~ Should work if you have Python and Java, but I have not tried myself